Search CORE

507 research outputs found

A novel architecture to classify histopathology images using convolutional neural networks

Author: Castelli Mauro
Kandel Ibrahem
Publication venue: 'MDPI AG'
Publication date: 23/04/2020
Field of study

Kandel, I., & Castelli, M. (2020). A novel architecture to classify histopathology images using convolutional neural networks. Applied Sciences (Switzerland), 10(8), 1-17. [2929]. https://doi.org/10.3390/APP10082929Histopathology is the study of tissue structure under the microscope to determine if the cells are normal or abnormal. Histopathology is a very important exam that is used to determine the patients' treatment plan. The classification of histopathology images is very difficult to even an experienced pathologist, and a second opinion is often needed. Convolutional neural network (CNN), a particular type of deep learning architecture, obtained outstanding results in computer vision tasks like image classification. In this paper, we propose a novel CNN architecture to classify histopathology images. The proposed model consists of 15 convolution layers and two fully connected layers. A comparison between different activation functions was performed to detect the most efficient one, taking into account two different optimizers. To train and evaluate the proposed model, the publicly available PatchCamelyon dataset was used. The dataset consists of 220,000 annotated images for training and 57,000 unannotated images for testing. The proposed model achieved higher performance compared to the state-of-the-art architectures with an AUC of 95.46%.publishersversionpublishe

Multidisciplinary Digital Publishing Institute

Repositório da Universidade Nova de Lisboa

Repository of the University of Ljubljana

Review, challenges, design, and development

Author: Castelli Mauro
Peres Fernando
Publication venue: 'MDPI AG'
Publication date: 01/07/2021
Field of study

Peres, F., & Castelli, M. (2021). Combinatorial optimization problems and metaheuristics: Review, challenges, design, and development. Applied Sciences (Switzerland), 11(14), 1-39. [6449]. https://doi.org/10.3390/app11146449In the past few decades, metaheuristics have demonstrated their suitability in addressing complex problems over different domains. This success drives the scientific community towards the definition of new and better-performing heuristics and results in an increased interest in this research field. Nevertheless, new studies have been focused on developing new algorithms without providing consolidation of the existing knowledge. Furthermore, the absence of rigor and formalism to classify, design, and develop combinatorial optimization problems and metaheuristics represents a challenge to the field’s progress. This study discusses the main concepts and challenges in this area and proposes a formalism to classify, design, and code combinatorial optimization problems and metaheuristics. We believe these contributions may support the progress of the field and increase the maturity of metaheuristics as problem solvers analogous to other machine learning algorithms.publishersversionpublishe

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals

Repositório da Universidade Nova de Lisboa

Learning the structure of Bayesian Networks: A quantitative assessment of the effect of different algorithmic schemes

Author: Beretta Stefano
Castelli Mauro
Goncalves Ivo
Henriques Roberto
Ramazzotti Daniele
Publication venue
Publication date: 01/01/2018
Field of study

One of the most challenging tasks when adopting Bayesian Networks (BNs) is the one of learning their structure from data. This task is complicated by the huge search space of possible solutions, and by the fact that the problem is NP-hard. Hence, full enumeration of all the possible solutions is not always feasible and approximations are often required. However, to the best of our knowledge, a quantitative analysis of the performance and characteristics of the different heuristics to solve this problem has never been done before. For this reason, in this work, we provide a detailed comparison of many different state-of-the-arts methods for structural learning on simulated data considering both BNs with discrete and continuous variables, and with different rates of noise in the data. In particular, we investigate the performance of different widespread scores and algorithmic approaches proposed for the inference and the statistical pitfalls within them

arXiv.org e-Print Archive

Directory of Open Access Journals

Repositório da Universidade Nova de Lisboa

Estudo Geral

Learning Curves Prediction for a Transformers-Based Model

Author: Castelli Mauro
Cruz Francisco
Publication venue: Ital Publication
Publication date: 01/10/2023
Field of study

One of the main challenges when training or fine-tuning a machine learning model concerns the number of observations necessary to achieve satisfactory performance. While, in general, more training observations result in a better-performing model, collecting more data can be time-consuming, expensive, or even impossible. For this reason, investigating the relationship between the dataset's size and the performance of a machine learning model is fundamental to deciding, with a certain likelihood, the minimum number of observations that are necessary to ensure a satisfactory-performing model is obtained as a result of the training process. The learning curve represents the relationship between the dataset’s size and the performance of the model and is especially useful when choosing a model for a specific task or planning the annotation work of a dataset. Thus, the purpose of this paper is to find the functions that best fit the learning curves of a Transformers-based model (LayoutLM) when fine-tuned to extract information from invoices. Two new datasets of invoices are made available for such a task. Combined with a third dataset already available online, 22 sub-datasets are defined, and their learning curves are plotted based on cross-validation results. The functions are fit using a non-linear least squares technique. The results show that both a bi-asymptotic and a Morgan-Mercer-Flodin function fit the learning curves extremely well. Also, an empirical relation is presented to predict the learning curve from a single parameter that may be easily obtained in the early stage of the annotation process. Doi: 10.28991/ESJ-2023-07-05-03 Full Text: PD

Emerging Science Journal (ESJ)

Repositório da Universidade Nova de Lisboa

Combining Bayesian Approaches and Evolutionary Techniques for the Inference of Breast Cancer Networks

Author: Beretta Stefano
Castelli Mauro
Goncalves Ivo
Merelli Ivan
Ramazzotti Daniele
Publication venue: 'Scitepress'
Publication date: 01/01/2016
Field of study

Gene and protein networks are very important to model complex large-scale systems in molecular biology. Inferring or reverseengineering such networks can be defined as the process of identifying gene/protein interactions from experimental data through computational analysis. However, this task is typically complicated by the enormously large scale of the unknowns in a rather small sample size. Furthermore, when the goal is to study causal relationships within the network, tools capable of overcoming the limitations of correlation networks are required. In this work, we make use of Bayesian Graphical Models to attach this problem and, specifically, we perform a comparative study of different state-of-the-art heuristics, analyzing their performance in inferring the structure of the Bayesian Network from breast cancer data

arXiv.org e-Print Archive

Crossref

Repositório da Universidade Nova de Lisboa

Machine learning applied to banking supervision a literature review

Author: Castelli Mauro
Guerra Pedro
Publication venue: 'MDPI AG'
Publication date: 01/07/2021
Field of study

Guerra, P., & Castelli, M. (2021). Machine learning applied to banking supervision a literature review. Risks, 9(7), 1-24. [136]. https://doi.org/10.3390/risks9070136Machine learning (ML) has revolutionised data analysis over the past decade. Like in-numerous other industries heavily reliant on accurate information, banking supervision stands to benefit greatly from this technological advance. The objective of this review is to provide a compre-hensive walk-through of how the most common ML techniques have been applied to risk assessment in banking, focusing on a supervisory perspective. We searched Google Scholar, Springer Link, and ScienceDirect databases for articles including the search terms “machine learning” and (“bank” or “banking” or “supervision”). No language, date, or Journal filter was applied. Papers were then screened and selected according to their relevance. The final article base consisted of 41 papers and 2 book chapters, 53% of which were published in the top quartile journals in their field. Results are presented in a timeline according to the publication date and categorised by time slots. Credit risk assessment and stress testing are highlighted topics as well as other risk perspectives, with some references to ML application surveys. The most relevant ML techniques encompass k-nearest neigh-bours (KNN), support vector machines (SVM), tree-based models, ensembles, boosting techniques, and artificial neural networks (ANN). Recent trends include developing early warning systems (EWS) for bankruptcy and refining stress testing. One limitation of this study is the paucity of contributions using supervisory data, which justifies the need for additional investigation in this field. However, there is increasing evidence that ML techniques can enhance data analysis and decision making in the banking industry.publishersversionpublishe

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals

Repositório da Universidade Nova de Lisboa

Special Issue: Generative Models in Artificial Intelligence and Their Applications

Author: Castelli Mauro
Manzoni Luca
Publication venue: 'MDPI AG'
Publication date: 20/04/2022
Field of study

Castelli, M. (Guest ed.), & Manzoni, L. (Guest ed.) (2022). Special Issue: Generative Models in Artificial Intelligence and Their Applications. Applied Sciences (Switzerland), 12(9), [4127]. https://doi.org/10.3390/app12094127In recent years, artificial intelligence has been used to generate a significant amount of high-quality data, such as images, music, and videos. The creation of such a vast amount of synthetic data was made possible due to the improved performance of different machine learning techniques, such as artificial neural networks. Considering the increased interest in this area, new techniques for automatic data generation and augmentation have recently been proposed. For instance, generative adversarial networks (GANs) and their variants are nowadays popular techniques in this research field. The creation of synthetic data was also achieved with evolutionary-based techniques, for instance, in the context of multimedia artifacts creationpublishersversionpublishe

Multidisciplinary Digital Publishing Institute

Repositório da Universidade Nova de Lisboa

Regional variation in the productivity of the English National Health Service

Author: Adriana Castelli
Andrew Street
Chris Bojke
Mauro Laudicella
Padraic Ward
Publication venue
Publication date
Field of study

At a time when there are severe pressures on reducing public spending there is increasing emphasis on determining which parts of the country secure best value for money in the NHS. By linking together large scale and routinely collected datasets we produce and compare productivity estimates across the ten Strategic Health Authorities in England in 2007/08.

Research Papers in Economics

A multiple expression alignment framework for genetic programming

Author: Castelli Mauro
Scott Kristen
Vanneschi Leonardo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Vanneschi, L., Scott, K., & Castelli, M. (2018). A multiple expression alignment framework for genetic programming. In M. Castelli, L. Sekanina, M. Zhang, S. Cagnoni, & P. García-Sánchez (Eds.), Genetic Programming: 21st European Conference, EuroGP 2018, Proceedings, pp. 166-183. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 10781 LNCS). Springer Verlag. DOI: 10.1007/978-3-319-77553-1_11Alignment in the error space is a recent idea to exploit semantic awareness in genetic programming. In a previous contribution, the concepts of optimally aligned and optimally coplanar individuals were introduced, and it was shown that given optimally aligned, or optimally coplanar, individuals, it is possible to construct a globally optimal solution analytically. As a consequence, genetic programming methods, aimed at searching for optimally aligned, or optimally coplanar, individuals were introduced. In this paper, we critically discuss those methods, analyzing their major limitations and we propose new genetic programming systems aimed at overcoming those limitations. The presented experimental results, conducted on four real-life symbolic regression problems, show that the proposed algorithms outperform not only the existing methods based on the concept of alignment in the error space, but also geometric semantic genetic programming and standard genetic programming.authorsversionpublishe

Repositório da Universidade Nova de Lisboa